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Abstract: 

This article aims to improving and drawing inferences about population 
characteristic estimation, some of mathematical methods were used in content 
of stock market data are collected from Amman stock exchange (ASE) using three 
methods; point, interval estimation and Wavelet transform (WT) combined with 
interval estimation. Point estimate can be ambiguous because it may or may not 
be close to the number actuality estimated. Themethodology is to compare 
between the point and interval estimations then the estimation has improved by 
combining WT with the interval estimation in order to reduce the error. The 
results show that (WT) with interval estimation is the best method, (SPSS) and 
mat lab 2010a have used in this study. 
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1- Introduction 

Inferential statistics concerned with drawing correct inferences about population 
characteristic (called parameter) based on sample statistics; drawn a sample 
from a population ; Calculated sample statistics on variable(X); and then attempt 
to make inference about variable(X) in the population from witch the sample 
drawn; Introduction to probability & statistical (Mendenhall et. al., 2012). 

Financial data sample is used to understand the performance and characteristics 
of the entire population. For example, all of the familiar stock market averages 
are samples designed to represent the broader stock market and indicate its 
performance return. For the domestic publicly-traded stock market, populated 
with at least 10,000 or more companies, the Dow Jones Industrial Average (DJIA) 
has just 30 representatives; the S&P 500 has 500. Yet these samples are taken as 
valid indicators of the broader population. It's important to understand the 
mechanics of sampling and estimating, particularly as they apply to financial 
variables, and have the insight to analysis the quality of research derived from 
sampling efforts. Therefore, the estimation accuracy and processes become a 
hot topics nowadays. Consequently, in this article will be improved in content of 
stock market data. Therefore. The novel technique in this article is in improving 
the estimation accuracy by combining WT with the interval estimation by 
reducing the bound of error. 

With regard to all those literature reviews, this study attempts to employ the 
proposed method to the daily stock market data from ASE. Three selected 
estimation models are used in the proposed method comparison to assess its 
performance. Experimental results show that the proposed method which is 
interval estimation with WT is superior to existing method in terms of some 
accuracy estimation error measure. Section 2 introduces the literature reviews of 
necessary used term. In section 3 the dataset will be presented with its statistical 
analysis. Whereas methods are used in this paper will be presented in section 4. 
Section 5 shows the results and discussion. Finally, in section 6 the conclusion 
will be presented. 

2- Literature Review 

(Raef & Bahrini, 2017) Processes and analyzes the technical efficiency of Islamic 
banks in the Middle East and North Africa region. For adjusting the estimation 
bias and making confidence intervals for the estimated efficiency scores at 
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desired levels of significance. (Jian, et, al., 2010) has proved that the point 
estimate can be confusing, because it may or may not be nearby to the quantity 
being estimated. Confidence Interval is one of the most suitable manners of 
quantifying uncertain due to sampling error. Therefore, the interval estimation is 
more preferred than the point estimation. (Filimonov, et al, 2017) have found 
that the interval estimates can suitable for the irritation parameters. The 
researcher test the method successfully on artificial price time series and on 
three well-known historical financial bubbles. (Lai & hang, 2017) studied 
maximum likelihood point estimates and confidence intervals depended on 
delta. The researcher organized model studies using both positive factor analysis 
and standard errors models, diverse methods can be the one that reduces best 
performance. (Sangnawakij & Niwitpong, 2016) inspect confidence intervals for 
the single coefficient of variation and the variance of coefficients of variation in 
the binary parameter exponential distributions^ Jinnah & Zhao , 2015) apply the 
realistic likelihood process to make suggestion on the bivariate subsistence job of 
paired failure times by estimating the subsistence job of cut time with the 
Kaplan-Meier estimator. (Helton, et al., 2017) studies the Analytical tools that 
are based on the parameter estimates using residual analysis and the Cook space 
for worldwide influence, and different alarm systems for local effect is offered to 
assess the act of the maximum likelihood estimators, and the predicting aptitude 
of the models is assessed by means of the old and density estimate evaluation 
methods. (Rada& Claudia, 2009) redirects joint maximum estimation and semi- 
parametric estimation of copula parameters in a bivariate t-copula.( 
Bruzda,2015) In the paper the researcher propose positive nonparametric 
estimators of random signals established on the wavelet transform. Consider 
stochastic signals rooted in white noise and abstractions with wavelet denoizing 
procedures using the non-decimated discrete wavelet transform and the 
awareness of wavelet scaling. The researcher assess properties of these 
estimators through extensive computer simulations and partially also 
analytically. Wavelet estimators of random signals have strong benefits over 
parametric maximum likelihood approaches as far as computational subjects are 
concerned, while at the same time they can compete with these approaches in 
rapports of precision of estimation in small samples. (RuiyanLuo&XinQi, 2015) 
This articleaim to transform the functional regression models to multiple linear 
regression models by means of the discrete wavelet transformation. When the 
number of analytical curves is huge, the multiple linear regression model 
typically has much better number of features than the sample size. The 
researcher apply correlation established sparse regression technique to the 
caused high dimensional regression model. The original feature of sparse 
technique is the researcher executesparsityconsequence on the way of the 
estimate of the coefficient coursein its place of the estimate itself, and only the 
direction of the estimate is determined by an optimization problem. The 
estimation reliability of the coefficient curve for the useful regression model is 
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obtained when both the sample size and the number of curves go to infinity. The 
effects of the separateexplanations are argued. Compare method with both 
functional regression methods and other wavelet grounded sparse regression 
approaches on together simulated data and four real data sets, with the cases of 
single and multiple predictive curves. The results indicate that sparse wavelet 
regression methods are enhanced in removing local features and method in this 
article has good predictive performances in all scenarios. 

(Yana, et a I, 2006) integrates identification method to consider the uncertainty 
effect on modal parameters for output-only system. The method is based on the 
time-frequency characteristics of the WT and the capabilities of the bootstrap 
distribution in statistical estimation. For the WT based identification method, the 
important issues related to identification accuracy such as modal separation, 
end-effect, associated with the parameter selection of wavelet function based on 
Shannon entropy, are given detailed investigations. (Donald, 2016) discuss the 
roots of the Allan variance trace back 50 years before to two seminal papers, one 
by (Allan, 1966) and the other by (Barnes, 1966). Since then, the Allan variance 
has played an important part in the description of high-performance time and 
occurrence standards. WT first rose in the initial 1980s in the geophysical 
literature, and the isolated WT developed projecting in the dawn 1980s in the 
signal processing literature. (Flandrin, 1992) briefly documented a connection 
between the Allan variance and WT based upon the Haar WT. (Percival and 
Guttorp, 1994) well-known that one general estimator of the Allan variance the 
maximal connection estimator can be interpreted in terms of a version of the 
DWT now widely referred to as themaximal overlap DWT (MODWT). 

As critically review this researchis differentfrom others because theresearchers 
gathered financial data from website of(ASE) about two stock market samples 
randomly in order to draw right inferences about population characteristic 
(mean, variance, stander deviation) throughcomputing point estimation 
andconfidence intervals usingsuitable formula then computing and comparing 
the result of one point estimation and confidence interval. Finally the best 
method will be combined with WT in order to improve the estimation accuracy. 

3- Dataset 

The researchergatheredhistorical time series data for the year 2016 from ASE 
about two sample banks (Bank ARBK Jordanian, Bank Jordan Commercial) the 
data randomly chosen to drawing correct inferences about population 
characteristic mean, variance, stander deviation) for more details about the 
dataset refer to the following links 

( http://www.exchanqe.io/ar/company historical/ARBKjhttp://www.exchange.jo 
/ar/company historical/JCBK) respectively. The following figure shows the 
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behavior of the data, whereas the table will show the statistical analysis of the 
data used. 


Table 1: numerical measures 
of center, spread, and 


Histogram 



Statistics table — 1 numerical 
measures of center, spread, and 
outliers 


MAX V ALUE 

N 

Valid 

24 

Missing 

O 

Mean 

$3.87 

Std. Error of Mean 

$0.50 

Std. Deviation 

$2.45 

Variance 

6.017 

Skewness 

O 

Std. Error of Skewness 

0.472 

Kurtosis 

-2.184 


Fig 1. Dataset Shape 


Some notes can be summarize about Fig. 1 and Table las: 

1- The skewness as the Table 1 the skew approximately (0.00) that mean the data 
distribution such as a normal distribution. 

2- Kurtosis tells how tall and sharp the central highest. 

3- Many statistics inferences require that a distribution be normal or nearly normal, 
a normal distribution has skewness and extra kurtosis of 0 , so if your distribution 
is close to those values then it is probably close to normal as in table- 1. That 
mean the dataset is normal distribution, and can predicting the max or min rang 
for value of stocks between 6.2132 to 6.3285$ Table 3 with 95% confidence 
interval for Mean (Westfall, et. al., 2014). 


4- Mathematical Models 

4.1. Point Estimation. 

The 95% margin of error estimated as (+ 1,96 * Jl) where S: standard deviation 
and n: sample size. 




4.2. IntervalEstimation 

One Population Mean can be estimated as (x 
to ( Mendenhall, et al. 2012) 



for more details refer 
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While in this study the independent sample for two population mean will be 
considered using the formula: 

|(xl- x 2) + 1.645* J~~ + j Where is: 


Table 2. Sample information 



sample-1 

sample-2 

Mean 

x 1 

* 2 

Variance 

sl A 2 

s2 A 2 

Sample size 

n 1 

n 2 

z a, 2 

z- value 

z- value 


• second interval estimate (confidence interval) provided an estimation range 
where in which true population parameter might fall in statistics, interval 
estimation is the habit of sample data to compute an interval of likely values of 
an unidentified population parameter, in difference to point estimation, which is 
a single number. (JerzyNeyman, 1937) investigates interval estimation as 
separate from point estimation. In doing so, he known that then-new work 
repeatingoutcomes in the form of an estimate plus-or-minus a standard 
deviation directed that interval estimation was really the 
problem statisticians really had in mind. After the authors have identified the 
interval estimation then some Scientific problems associated with interval 
estimation may be summarized as follows: 

1. When interval estimates are informed, they should have a commonly 
believedclarification in the technical community and more broadly. In this 
regard, credible intervals are held to be most readily assumed by the general 
public. Interval estimates derived from fuzzy logic have much more request- 
specific senses. 

2. For usuallyhappeningstates there should be sets of standard events that can be 
used, subject to the examination and soundness of any vitalexpectations. This 
relates for both confidence intervals and credible intervals. 

3. For more new situations there should be leadership on how interval estimates 
can be formulated. In this regard confidence intervals and credible intervals have 
a related standing but there are alterations. 

4. Credible intervals can readily agreement with previous information, while 
confidence intervals cannot. 

5. Confidence intervals are more flexible and can be used nearly in more states 
than credible intervals: one zone where credible intervals hurt in contrast is in 
trade with non-parametric models. (Jerzy Neyman, 1894 - 1981. Technical 
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Report No. 155) , Therefore, in the nest subsection the WT equation will be 
presented. 

4.3. Wavelet Transform 

Previous studies have used statistics, technical analysis, fundamental analysis, 
and linear regression to predict market direction [16]. However, price forecasting 
is generally conducted using technical analysis or fundamental analysis. Technical 
analysis concentrates on market action, while fundamental analysis concentrates 
on the forces of supply and demand that drive price movements. The basic 
assumption of this study is supported by studies of the insurance time series. To 
study the relations among the insurance time-series variables, this work presents 
a hybrid method that integrates a wavelet and the ARIMA based forecasting 
scheme. Fig. 1 shows the main procedures of this approach. Wavelet theory is 
applied for data preprocessing, since the representation of a wavelet can deal 
with the non-stationarity involved in the economic and financial time series [26], 
The key property of wavelets for economic analysis is decomposition by time 
scale. Economic and financial systems contain variables that operate on various 
time scales simultaneously; thus, the relations between variables may differ 
across time scales. One of the benefits of the wavelet approach is that it is 
flexible in handling highly irregular data series [24], This study applies the 
orthogonal wavelet transform as the main wavelet transform tool. A wavelet not 
only decomposes the data inters of times and frequency, but also significantly 
reduces the processing time. Let n denote the time series size, then the wavelet 
decomposition used in this study can be determined in O (n) time [28], The 
following framework will be summarized the research methodology for this 
article: 



Fig - 2 Research Methodology Shape 
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Wavelets theory is based on Fourier analysis, which represents any function as 
the sum of the sine and cosine functions. A wavelet is simply a function of time t 
that obeys a basic rule, known as the wavelet admissibility condition [17]: 

^ rk(/)L 

c v=\—j Ad f <cc 

0 J 

Where . (pitjr transform and a function of frequency f ,is the Fourie <p(f) Do 
The wavelet transform (WT) is a mathematical tool that can be applied to 
numerous applications, such as image analysis and signal processing. It was 
introduced to solve problems associated with the Fourier transform as they 
occur. This occurrence can take place when dealing with nonstationary signals, or 
when dealing with signals that are localized in time, space, or frequency. 
Depending on the normalization rules, there are two types of wavelets within a 
given function/family. Father wavelets describe the smooth and low-frequency 
parts of a signal, and mother wavelets describe the detailed and high-frequency 
components. In the following equations, (2a) represents the father wavelet and 
(2b) represents the mother wavelet, with j=l... J in the J-level wavelet 
decomposition: [14] 

</>j,k = 2-> n <l>(t-2 ] k/2’) 

( 2 ) 

(pj,k = 2~ ,n (pit - 2 J k 12’) 

Where J denotes the maximum scale sustainable by the number of data points 
and the two types of wavelets stated above, namely father wavelets and mother 
wavelets, and satisfies: 

| </>(t)dt = 1 and J <p(t)dt = 0 (3) 

Time series data, i.e., function f(t), is an input represented by wavelet analysis, 
and can be built up as a sequence of projections onto father and mother 
wavelets indexed by both {k}, k = {0, 1, 2,. . .} and by{S}=2 j , {j=l,2,3,. . .J}. 
Analyzing real discretely sampled data requires creating a lattice for making 
calculations. Mathematically, it is convenient to use a dyadic expansion, as 
shown in equation (3). The expansion coefficients are given by the projections: 

Sj, k = J (f)j, kf(t)dt, dj, k = | qoj , kf (t)dt, (4) 

The orthogonal wavelet series approximation to f (t) is defined by: 

Pit) =Y, S .h tyj, %)+ k <pj, k it ) _ !’ k( PJ ~!’ k it) +•••+M, kit) (5) 

Sj(t) = ^Sj,k0j,k(t) 

(6)The WT is used to calculate the 

Dj(t) = 2]dj,k<pj,k(t) 

coefficient of the wavelet series approximation in Eq. (5) for a discrete signal. 
Where Sj(t) and Dj(t) are introducing the smooth and details coefficients 
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respectively? The smooth coefficients dives the most important features of the 
data set and the details coefficients are used to detect the main features in the 
dataset. 

When the data pattern is very rough, the wavelet process is repeatedly applied. 
The aim of preprocessing is to minimize the Root Mean Squared Error (RMSE) 
between the signal before and after transformation. The noise in the original 
data can thus be removed. Importantly, the adaptive noise in the training 
pattern may reduce the risk of over fitting in training phase. Thus, we adopt WT 
twice for the preprocessing of training data in this study. 

5- Results and Discussion 

The result of comparing manually and computer software (s.p.s.s) between point 
estimation and Confidence intervals (C. I) estimation the researcher found that 
the point estimation give information about the population not correct and 
mostly give you signal number WhileConfidence intervals representation more 
exact about population parameter characteristics 


Point estimation. 

The 95% margin of error estimated as ( + 1.96 * 
the point estimation for Arabbank (max value) is: 



) According with table-1 


1.645 * ( ) = (- -516 , + .516) hence the 95% confidence interval 

X '0.09070 ' ' ' 

Vlf 


for 


00 


is from 


(-0. 516 to 0.516) $ per share. 


The researcher generate formula to extract two values; the lower confidence 
limit (LCL) and the upper confidence limit (UCL). 

Depended on table - 1, and take each sample independently ARBK BANK with 
(C.l) = .95 

x = 6.2708 Z m2 = 1.645 

S= 09070 n = 12 month 

, 0.09070 

6.2708 ± 1.645 *(—=—) = (6.2132, 6.3285) Hence, the 95% confidence 

V12 

interval for (p ) is from (6.2132 to 6.3285) $ per share. That result confirm with 
software (s.p.s.s) for one population. 
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From Table -1 SPSS - SOFTWARE 


Lower Bound LCL 

$6.2132 

Upper Bound UCL 

$6.3285 


95% Confidence Interval for Mean 

00 


If you compare the result one point (- .516 to .516) and confidence interval 
(6.2132 to 6.3285) you note the result in the and confidence interval is more 
accuracy from one point and take widely rang and know more about population 
characteristic, Note: when used law z- table or law t - table the results is similar. 

sample independent 

According in table - 1 the researcher extract C. I (6.2708 - 1.4717) + 1.645 * ( 
f —-) = (4.71908, 4.87926) Hence, the 95% confidence interval for ( u ) 
is from 4. 71908 to 4.87926 $ per share. 



Table-1 SPSS - SOFTWARE 


95% Confidence Interval for 
Mean 

00 


Lower Bound LCL 

$4.71908 

Upper Bound UCL 

$4.87926 


Std. Error 

point 

estimation 

interval 

estimation 

Wavelet 

Transform 

Std. Error for one sample 
(Arab bank) 

0.03 Arab 

bank 

0.125 

0.3*10 2 

S .d (Std. Deviation) for 
one sample 

0.09 

0.09 


Std. Error for two sample 


2.45 


S .d (Std. Deviation) for 
two sample 


.50 

0.08*10 3 


Conclusion 

In this article has discussed the point and interval estimations in more details in 
content of stock market data by drawing correct inferences about population 
characteristic based on point estimation and confidence intervals estimation. In 
doing this, population characteristic (mean, variance, stander deviation) using 
interval confidence is more accuracy thane point estimation. After the 
researchers have approve this results then a Novel contribution has approved 
also by improving the forecasting accuracy through combining the WT with 
confidence interval estimation processes. 
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